Skip to content

Handle null lanes in primitive take#8505

Merged
gatesn merged 1 commit into
developfrom
ngates/primitive-take-null-indices
Jun 19, 2026
Merged

Handle null lanes in primitive take#8505
gatesn merged 1 commit into
developfrom
ngates/primitive-take-null-indices

Conversation

@gatesn

@gatesn gatesn commented Jun 19, 2026

Copy link
Copy Markdown
Contributor

Avoid dereferencing physical index values from null take lanes by zeroing those lanes before the cast/gather and intersecting the original index validity back into the result validity.

Avoid dereferencing physical index values from null take lanes by zeroing those lanes before the cast/gather and intersecting the original index validity back into the result validity.

Signed-off-by: Nicholas Gates <nick@nickgates.com>
@gatesn gatesn requested review from a team and robert3005 June 19, 2026 11:59
@gatesn gatesn added the changelog/fix A bug fix label Jun 19, 2026
@gatesn gatesn enabled auto-merge (squash) June 19, 2026 12:00
@gatesn gatesn merged commit 575db9c into develop Jun 19, 2026
72 of 73 checks passed
@gatesn gatesn deleted the ngates/primitive-take-null-indices branch June 19, 2026 12:06
@codspeed-hq

codspeed-hq Bot commented Jun 19, 2026

Copy link
Copy Markdown

Merging this PR will degrade performance by 11%

⚠️ Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚠️ Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 2 improved benchmarks
❌ 7 regressed benchmarks
✅ 1572 untouched benchmarks

Warning

Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

Mode Benchmark BASE HEAD Efficiency
Simulation decompress_rd[f64, (10000, 0.01)] 108.7 µs 139.1 µs -21.89%
Simulation decompress_rd[f64, (10000, 0.1)] 109 µs 139.5 µs -21.85%
Simulation decompress_rd[f64, (10000, 0.0)] 108.7 µs 139.1 µs -21.83%
Simulation decompress_rd[f32, (100000, 0.0)] 496 µs 583.8 µs -15.05%
Simulation decompress_rd[f32, (10000, 0.1)] 78.1 µs 91.2 µs -14.43%
Simulation decompress_rd[f32, (10000, 0.01)] 78.1 µs 91 µs -14.2%
Simulation decompress_rd[f32, (10000, 0.0)] 78.5 µs 91.2 µs -13.91%
Simulation chunked_varbinview_opt_canonical_into[(1000, 10)] 206.8 µs 170.2 µs +21.46%
Simulation chunked_varbinview_into_canonical[(100, 100)] 307.1 µs 272.8 µs +12.59%

Tip

Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.


Comparing ngates/primitive-take-null-indices (c313c1f) with develop (35e4d72)

Open in CodSpeed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/fix A bug fix

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants